Spam Mail Filtering through Dynamically Updating URL Statistics
نویسندگان
چکیده
This paper presents a unique spam mail filtering technique based on a deep analysis of statistics on URL’s included in various e-mails gathered from a laboratory in a university for about six months. Since the proposed mail filtering technique searches only URL’s in mail, the overhead introduced by searching all mail contents or black list utilized by many other mail filtering algorithms is significantly reduced. In addition, the proposed filtering technique dynamically updates URL list through client feedback, and the bias possibly introduced by selecting bad training mail set can be eliminated as the filtering process is progressed.
منابع مشابه
On the properties of spam-advertised URL addresses
The main purpose of most spam e-mail messages distributed on Internet today is to entice recipients into visiting World Wide Web pages that are advertised through spam. In essence, e-mail spamming is a campaign that advertises URL addresses at a massive scale and at minimum cost for the advertisers and those advertised. Nevertheless, the characteristics of URL addresses and of web sites adverti...
متن کاملEstablishing Trust Between Mail Servers to Improve Spam Filtering
This paper proposes a new way to improve spam filtering based on the establishment and maintenance of trust between mail domains. An architecture is presented where each mail domain has an associated trust manager that dynamically records trust measures pertaining to other domains. Trust by one mail domain in another is influenced by direct experience as well as recommendations issued by collab...
متن کاملSpamato Reloaded Trust, Authentication and More in a Collaborative Spam Filter System
Spamato is a collaborative spam filter system implemented in Java. It is designed as a framework to support any number and kind of spam filters. The initial version features an URL Filter, which extracts URLs from incoming mail messages and calculates a fingerprint based on these URLs. This fingerprint is compared to a central database. If its fingerprint is known as spam, the mail message is c...
متن کاملTwo Approaches on Implementation of CBR and CRM Technologies to the Spam Filtering Problem
Recently the number of undesirable messages coming to e-mail has strongly increased. As spam has changeable character the anti-spam systems should be trainable and dynamical. The machine learning technology is successfully applied in a filtration of e-mail from undesirable messages for a long time. In this paper it is offered to apply Case Based Reasoning technology to a spam filtering problem....
متن کاملApplications of Text Clustering Based on Semantic Body for Chinese Spam Filtering
The effect of spam filtering method based on statistics is not good enough in filtering the new-type spam with synonymous substitution and camouflage, because the method based on statistics ignores the semantic relation between words in the text, and only judges from the word itself. So, a method of spam filtering based on the semantic body is proposed in this paper. The method adopts lexical c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005